Graph Cut Algorithms for Binocular Stereo with Occlusions

نویسندگان

  • Vladimir Kolmogorov
  • Ramin Zabih
چکیده

Most binocular stereo algorithms assume that all scene elements are visible from both cameras. Scene elements that are visible from only one camera, known as occlusions, pose an important challenge for stereo. Occlusions are important for segmentation, because they appear near discontinuities. However, stereo algorithms tend to ignore occlusions because of their difficulty. One reason is that occlusions require the input images to be treated symmetrically, which complicates the problem formulation. Worse, certain depth maps imply physically impossible scene configurations, and must be excluded from the output. In this chapter we approach the problem of binocular stereo with occlusions from an energy minimization viewpoint. We begin by reviewing traditional stereo methods that do not handle occlusions. If occlusions are ignored, it is easy to formulate the stereo problem as a pixel labeling problem, which leads to an energy function that is common in early vision. This kind of energy function can be minimized using graph cuts, which is a combinatorial optimization technique that has proven to be very effective for low-level vision problems. Motivated by this, we have designed two graph cut stereo algorithms that are designed to handle occlusions. These algorithms produce promising experimental results on real data with ground truth. 1 Traditional stereo methods Computing stereo depth is a traditional problem in computer vision, and has been the focus of a great deal of work (see [5, 17] for recent surveys). Given a pair of images taken at the same time, two pixels are said to correspond if they show the same scene element. The goal of stereo is to compute correspondences between pixels, which then determines depth. The binocular stereo problem is typically formulated as follows: For every pixel in one image, find the corresponding pixel in the other image. We will refer to this as the traditional stereo problem. The problem formulation above has many advantages. It easily fits within a class of problems that arise in early vision called pixel labeling problems, 2 Vladimir Kolmogorov, Ramin Zabih where the goal is to assign each pixel p = (px, py) ∈ P a label from some set L. The label set L depends upon the particular problem; for example, in image denoising, L is intensities. In stereo, L consists of disparities. Pixel labeling problems have been widely studied in computer vision. The problem is naturally formulated in terms of energy minimization, where the goal is to find the labeling f = (f1, . . . , fp, . . . , f|P|) that minimizes

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast Segmentation-based Dense Stereo from Quasi-dense Matching

We propose a segmentation-based dense stereo algorithm within an energy minimization framework. The cost function includes a new consistency term to take into account an initial quasi-dense disparity map and handles occlusions explicitly. Based on quasi-dense matching and color segmentation, optimization is performed efficiently by assuming a constant disparity for each region. The assumption i...

متن کامل

Computing Visual Correspondence with Occlusions using Graph Cuts

Several new algorithms for visual correspondence based on graph cuts [7, 14, 17] have recently been developed. While these methods give very strong results in practice, they do not handle occlusions properly. Specifically, they treat the two input images asymmetrically, and they do not ensure that a pixel corresponds to at most one pixel in the other image. In this paper, we present a new metho...

متن کامل

Computing Visual Correspondence with Occlusions via Graph Cuts

Several new algorithms for visual correspondence based on graph cuts [6, 13, 16] have recently been developed. While these methods give very strong results in practice, they do not handle occlusions properly. Specifically, they treat the two input images asymmetrically, and they do not ensure that a pixel corresponds to at most one pixel in the other image. In this paper, we present two new met...

متن کامل

Comparison of Energy Minimization Algorithms for Highly Connected Graphs

Algorithms for discrete energy minimization play a fundamental role for low-level vision. Known techniques include graph cuts, belief propagation (BP) and recently introduced tree-reweighted message passing (TRW). So far, the standard benchmark for their comparison has been a 4-connected grid-graph arising in pixel-labelling stereo. This minimization problem, however, has been largely solved: r...

متن کامل

Generation of an accurate Facialground Truth for Stereo Algorithm Evaluation

Our experiment evaluates the respective performance of stereo algorithms on a set of stereo-pairs and associated ground truth disparity map generated from a 3D face surface. Contrary to expectations, there is seemingly very little difference between the performances of many currently known stereo algorithms in the case of human face reconstruction, although more complex methods (such as graph-c...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006